Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 3116945 |
| Missing cells | 15868508 |
| Missing cells (%) | 23.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.4 GiB |
| Average record size in memory | 840.4 B |
Variable types
| Numeric | 4 |
|---|---|
| Categorical | 10 |
| Text | 8 |
cap-diameter is highly overall correlated with stem-height and 1 other fields | High correlation |
class is highly overall correlated with stem-root | High correlation |
stem-height is highly overall correlated with cap-diameter | High correlation |
stem-root is highly overall correlated with class | High correlation |
stem-width is highly overall correlated with cap-diameter | High correlation |
does-bruise-or-bleed is highly imbalanced (85.7%) | Imbalance |
gill-spacing is highly imbalanced (80.6%) | Imbalance |
stem-root is highly imbalanced (66.8%) | Imbalance |
veil-type is highly imbalanced (99.8%) | Imbalance |
veil-color is highly imbalanced (69.8%) | Imbalance |
has-ring is highly imbalanced (82.4%) | Imbalance |
ring-type is highly imbalanced (79.3%) | Imbalance |
spore-print-color is highly imbalanced (56.6%) | Imbalance |
cap-surface has 671023 (21.5%) missing values | Missing |
gill-attachment has 523936 (16.8%) missing values | Missing |
gill-spacing has 1258435 (40.4%) missing values | Missing |
stem-root has 2757023 (88.5%) missing values | Missing |
stem-surface has 1980861 (63.6%) missing values | Missing |
veil-type has 2957493 (94.9%) missing values | Missing |
veil-color has 2740947 (87.9%) missing values | Missing |
ring-type has 128880 (4.1%) missing values | Missing |
spore-print-color has 2849682 (91.4%) missing values | Missing |
id is uniformly distributed | Uniform |
id has unique values | Unique |
Reproduction
| Analysis started | 2024-09-04 09:09:27.286239 |
|---|---|
| Analysis finished | 2024-09-04 09:12:31.396761 |
| Duration | 3 minutes and 4.11 seconds |
| Software version | ydata-profiling vv4.9.0 |
| Download configuration | config.json |
id
Real number (ℝ)
UNIFORM  UNIQUE 
| Distinct | 3116945 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1558472 |
| Minimum | 0 |
|---|---|
| Maximum | 3116944 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 23.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 155847.2 |
| Q1 | 779236 |
| median | 1558472 |
| Q3 | 2337708 |
| 95-th percentile | 2961096.8 |
| Maximum | 3116944 |
| Range | 3116944 |
| Interquartile range (IQR) | 1558472 |
Descriptive statistics
| Standard deviation | 899784.66 |
|---|---|
| Coefficient of variation (CV) | 0.57735055 |
| Kurtosis | -1.2 |
| Mean | 1558472 |
| Median Absolute Deviation (MAD) | 779236 |
| Skewness | -2.5075827 × 10-15 |
| Sum | 4.8576715 × 1012 |
| Variance | 8.0961244 × 1011 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 2077967 | 1 | < 0.1% |
| 2077958 | 1 | < 0.1% |
| 2077959 | 1 | < 0.1% |
| 2077960 | 1 | < 0.1% |
| 2077961 | 1 | < 0.1% |
| 2077962 | 1 | < 0.1% |
| 2077963 | 1 | < 0.1% |
| 2077964 | 1 | < 0.1% |
| 2077965 | 1 | < 0.1% |
| Other values (3116935) | 3116935 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 3116944 | 1 | |
| 3116943 | 1 | |
| 3116942 | 1 | |
| 3116941 | 1 | |
| 3116940 | 1 | |
| 3116939 | 1 | |
| 3116938 | 1 | |
| 3116937 | 1 | |
| 3116936 | 1 | |
| 3116935 | 1 |
class
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 148.6 MiB |
| p | |
|---|---|
| e |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3116945 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | e |
|---|---|
| 2nd row | p |
| 3rd row | e |
| 4th row | e |
| 5th row | e |
Common Values
| Value | Count | Frequency (%) |
| p | 1705396 | |
| e | 1411549 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| p | 1705396 | |
| e | 1411549 |
Most occurring characters
| Value | Count | Frequency (%) |
| p | 1705396 | |
| e | 1411549 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3116945 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| p | 1705396 | |
| e | 1411549 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3116945 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| p | 1705396 | |
| e | 1411549 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3116945 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| p | 1705396 | |
| e | 1411549 |
cap-diameter
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 3913 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.3098484 |
| Minimum | 0.03 |
|---|---|
| Maximum | 80.67 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 23.8 MiB |
Quantile statistics
| Minimum | 0.03 |
|---|---|
| 5-th percentile | 1.34 |
| Q1 | 3.32 |
| median | 5.75 |
| Q3 | 8.24 |
| 95-th percentile | 13.23 |
| Maximum | 80.67 |
| Range | 80.64 |
| Interquartile range (IQR) | 4.92 |
Descriptive statistics
| Standard deviation | 4.6579305 |
|---|---|
| Coefficient of variation (CV) | 0.73820007 |
| Kurtosis | 32.743381 |
| Mean | 6.3098484 |
| Median Absolute Deviation (MAD) | 2.46 |
| Skewness | 3.9726092 |
| Sum | 19667425 |
| Variance | 21.696317 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.49 | 8164 | 0.3% |
| 3.18 | 7942 | 0.3% |
| 3.14 | 7361 | 0.2% |
| 1.51 | 7072 | 0.2% |
| 4.04 | 6828 | 0.2% |
| 3.28 | 6826 | 0.2% |
| 2.87 | 6807 | 0.2% |
| 3.85 | 6642 | 0.2% |
| 3.24 | 6634 | 0.2% |
| 1.52 | 6562 | 0.2% |
| Other values (3903) | 3046103 |
| Value | Count | Frequency (%) |
| 0.03 | 1 | < 0.1% |
| 0.1 | 1 | < 0.1% |
| 0.3 | 1 | < 0.1% |
| 0.38 | 1 | < 0.1% |
| 0.4 | 6 | < 0.1% |
| 0.41 | 2 | < 0.1% |
| 0.42 | 3 | < 0.1% |
| 0.44 | 15 | |
| 0.45 | 2 | < 0.1% |
| 0.46 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 80.67 | 1 | |
| 64.46 | 1 | |
| 62.4 | 1 | |
| 62.3 | 1 | |
| 62.06 | 1 | |
| 62.01 | 1 | |
| 60.97 | 1 | |
| 59.76 | 1 | |
| 59.74 | 2 | |
| 59.66 | 1 |
cap-shape
Text
| Distinct | 74 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 40 |
| Missing (%) | < 0.1% |
| Memory size | 148.6 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 1 |
| Mean length | 1.0000536 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3117072 |
|---|---|
| Distinct characters | 36 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 47 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | f |
|---|---|
| 2nd row | x |
| 3rd row | f |
| 4th row | f |
| 5th row | x |
| Value | Count | Frequency (%) |
| x | 1436030 | |
| f | 676240 | |
| s | 365147 | 11.7% |
| b | 318647 | 10.2% |
| o | 108835 | 3.5% |
| p | 106968 | 3.4% |
| c | 104520 | 3.4% |
| d | 65 | < 0.1% |
| e | 60 | < 0.1% |
| n | 41 | < 0.1% |
| Other values (62) | 360 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| x | 1436030 | |
| f | 676240 | |
| s | 365149 | 11.7% |
| b | 318647 | 10.2% |
| o | 108835 | 3.5% |
| p | 106969 | 3.4% |
| c | 104520 | 3.4% |
| d | 65 | < 0.1% |
| e | 61 | < 0.1% |
| . | 44 | < 0.1% |
| Other values (26) | 512 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3117072 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| x | 1436030 | |
| f | 676240 | |
| s | 365149 | 11.7% |
| b | 318647 | 10.2% |
| o | 108835 | 3.5% |
| p | 106969 | 3.4% |
| c | 104520 | 3.4% |
| d | 65 | < 0.1% |
| e | 61 | < 0.1% |
| . | 44 | < 0.1% |
| Other values (26) | 512 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3117072 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| x | 1436030 | |
| f | 676240 | |
| s | 365149 | 11.7% |
| b | 318647 | 10.2% |
| o | 108835 | 3.5% |
| p | 106969 | 3.4% |
| c | 104520 | 3.4% |
| d | 65 | < 0.1% |
| e | 61 | < 0.1% |
| . | 44 | < 0.1% |
| Other values (26) | 512 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3117072 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| x | 1436030 | |
| f | 676240 | |
| s | 365149 | 11.7% |
| b | 318647 | 10.2% |
| o | 108835 | 3.5% |
| p | 106969 | 3.4% |
| c | 104520 | 3.4% |
| d | 65 | < 0.1% |
| e | 61 | < 0.1% |
| . | 44 | < 0.1% |
| Other values (26) | 512 | < 0.1% |
cap-surface
Text
MISSING 
| Distinct | 83 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 671023 |
| Missing (%) | 21.5% |
| Memory size | 137.1 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 1 |
| Mean length | 1.0001402 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2446265 |
|---|---|
| Distinct characters | 38 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 54 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | s |
|---|---|
| 2nd row | h |
| 3rd row | s |
| 4th row | y |
| 5th row | l |
| Value | Count | Frequency (%) |
| t | 460779 | |
| s | 384970 | |
| y | 327827 | |
| h | 284463 | |
| g | 263729 | |
| d | 206832 | |
| k | 128876 | 5.3% |
| e | 119712 | 4.9% |
| i | 113440 | 4.6% |
| w | 109840 | 4.5% |
| Other values (68) | 45465 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 460785 | |
| s | 385005 | |
| y | 327831 | |
| h | 284466 | |
| g | 263735 | |
| d | 206841 | |
| k | 128876 | 5.3% |
| e | 119741 | 4.9% |
| i | 113454 | 4.6% |
| w | 109840 | 4.5% |
| Other values (28) | 45691 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2446265 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 460785 | |
| s | 385005 | |
| y | 327831 | |
| h | 284466 | |
| g | 263735 | |
| d | 206841 | |
| k | 128876 | 5.3% |
| e | 119741 | 4.9% |
| i | 113454 | 4.6% |
| w | 109840 | 4.5% |
| Other values (28) | 45691 | 1.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2446265 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 460785 | |
| s | 385005 | |
| y | 327831 | |
| h | 284466 | |
| g | 263735 | |
| d | 206841 | |
| k | 128876 | 5.3% |
| e | 119741 | 4.9% |
| i | 113454 | 4.6% |
| w | 109840 | 4.5% |
| Other values (28) | 45691 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2446265 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 460785 | |
| s | 385005 | |
| y | 327831 | |
| h | 284466 | |
| g | 263735 | |
| d | 206841 | |
| k | 128876 | 5.3% |
| e | 119741 | 4.9% |
| i | 113454 | 4.6% |
| w | 109840 | 4.5% |
| Other values (28) | 45691 | 1.9% |
cap-color
Text
| Distinct | 78 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 12 |
| Missing (%) | < 0.1% |
| Memory size | 148.6 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 1 |
| Mean length | 1.0001011 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3117248 |
|---|---|
| Distinct characters | 37 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 49 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | u |
|---|---|
| 2nd row | o |
| 3rd row | b |
| 4th row | g |
| 5th row | w |
| Value | Count | Frequency (%) |
| n | 1359544 | |
| y | 386627 | 12.4% |
| w | 379442 | 12.2% |
| g | 210825 | 6.8% |
| e | 197290 | 6.3% |
| o | 178847 | 5.7% |
| p | 91838 | 2.9% |
| r | 78236 | 2.5% |
| u | 73172 | 2.3% |
| b | 61313 | 2.0% |
| Other values (68) | 99801 | 3.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 1359556 | |
| y | 386633 | 12.4% |
| w | 379442 | 12.2% |
| g | 210831 | 6.8% |
| e | 197314 | 6.3% |
| o | 178860 | 5.7% |
| p | 91844 | 2.9% |
| r | 78248 | 2.5% |
| u | 73175 | 2.3% |
| b | 61317 | 2.0% |
| Other values (27) | 100028 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3117248 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 1359556 | |
| y | 386633 | 12.4% |
| w | 379442 | 12.2% |
| g | 210831 | 6.8% |
| e | 197314 | 6.3% |
| o | 178860 | 5.7% |
| p | 91844 | 2.9% |
| r | 78248 | 2.5% |
| u | 73175 | 2.3% |
| b | 61317 | 2.0% |
| Other values (27) | 100028 | 3.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3117248 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 1359556 | |
| y | 386633 | 12.4% |
| w | 379442 | 12.2% |
| g | 210831 | 6.8% |
| e | 197314 | 6.3% |
| o | 178860 | 5.7% |
| p | 91844 | 2.9% |
| r | 78248 | 2.5% |
| u | 73175 | 2.3% |
| b | 61317 | 2.0% |
| Other values (27) | 100028 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3117248 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 1359556 | |
| y | 386633 | 12.4% |
| w | 379442 | 12.2% |
| g | 210831 | 6.8% |
| e | 197314 | 6.3% |
| o | 178860 | 5.7% |
| p | 91844 | 2.9% |
| r | 78248 | 2.5% |
| u | 73175 | 2.3% |
| b | 61317 | 2.0% |
| Other values (27) | 100028 | 3.2% |
does-bruise-or-bleed
Categorical
IMBALANCE 
| Distinct | 26 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 8 |
| Missing (%) | < 0.1% |
| Memory size | 148.6 MiB |
| f | |
|---|---|
| t | |
| w | 14 |
| c | 11 |
| h | 9 |
| Other values (21) | 75 |
Length
| Max length | 8 |
|---|---|
| Median length | 1 |
| Mean length | 1.0000048 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3116952 |
|---|---|
| Distinct characters | 28 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | f |
|---|---|
| 2nd row | f |
| 3rd row | f |
| 4th row | f |
| 5th row | f |
Common Values
| Value | Count | Frequency (%) |
| f | 2569743 | |
| t | 547085 | 17.6% |
| w | 14 | < 0.1% |
| c | 11 | < 0.1% |
| h | 9 | < 0.1% |
| y | 7 | < 0.1% |
| a | 7 | < 0.1% |
| b | 7 | < 0.1% |
| x | 7 | < 0.1% |
| s | 6 | < 0.1% |
| Other values (16) | 41 | < 0.1% |
| (Missing) | 8 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| f | 2569743 | |
| t | 547085 | 17.6% |
| w | 14 | < 0.1% |
| c | 11 | < 0.1% |
| h | 9 | < 0.1% |
| y | 7 | < 0.1% |
| a | 7 | < 0.1% |
| b | 7 | < 0.1% |
| x | 7 | < 0.1% |
| s | 6 | < 0.1% |
| Other values (16) | 41 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 2569743 | |
| t | 547085 | 17.6% |
| w | 14 | < 0.1% |
| c | 11 | < 0.1% |
| h | 10 | < 0.1% |
| a | 8 | < 0.1% |
| y | 7 | < 0.1% |
| b | 7 | < 0.1% |
| x | 7 | < 0.1% |
| s | 7 | < 0.1% |
| Other values (18) | 53 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3116952 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| f | 2569743 | |
| t | 547085 | 17.6% |
| w | 14 | < 0.1% |
| c | 11 | < 0.1% |
| h | 10 | < 0.1% |
| a | 8 | < 0.1% |
| y | 7 | < 0.1% |
| b | 7 | < 0.1% |
| x | 7 | < 0.1% |
| s | 7 | < 0.1% |
| Other values (18) | 53 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3116952 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| f | 2569743 | |
| t | 547085 | 17.6% |
| w | 14 | < 0.1% |
| c | 11 | < 0.1% |
| h | 10 | < 0.1% |
| a | 8 | < 0.1% |
| y | 7 | < 0.1% |
| b | 7 | < 0.1% |
| x | 7 | < 0.1% |
| s | 7 | < 0.1% |
| Other values (18) | 53 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3116952 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| f | 2569743 | |
| t | 547085 | 17.6% |
| w | 14 | < 0.1% |
| c | 11 | < 0.1% |
| h | 10 | < 0.1% |
| a | 8 | < 0.1% |
| y | 7 | < 0.1% |
| b | 7 | < 0.1% |
| x | 7 | < 0.1% |
| s | 7 | < 0.1% |
| Other values (18) | 53 | < 0.1% |
gill-attachment
Text
MISSING 
| Distinct | 78 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 523936 |
| Missing (%) | 16.8% |
| Memory size | 139.6 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 1 |
| Mean length | 1.0000891 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2593240 |
|---|---|
| Distinct characters | 37 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 53 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | a |
|---|---|
| 2nd row | a |
| 3rd row | x |
| 4th row | s |
| 5th row | d |
| Value | Count | Frequency (%) |
| a | 646035 | |
| d | 589237 | |
| x | 360878 | |
| e | 301858 | |
| s | 295439 | |
| p | 279112 | |
| f | 119956 | 4.6% |
| c | 74 | < 0.1% |
| u | 56 | < 0.1% |
| w | 37 | < 0.1% |
| Other values (64) | 334 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 646042 | |
| d | 589242 | |
| x | 360878 | |
| e | 301872 | |
| s | 295458 | |
| p | 279113 | |
| f | 119956 | 4.6% |
| c | 74 | < 0.1% |
| u | 57 | < 0.1% |
| . | 44 | < 0.1% |
| Other values (27) | 504 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2593240 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 646042 | |
| d | 589242 | |
| x | 360878 | |
| e | 301872 | |
| s | 295458 | |
| p | 279113 | |
| f | 119956 | 4.6% |
| c | 74 | < 0.1% |
| u | 57 | < 0.1% |
| . | 44 | < 0.1% |
| Other values (27) | 504 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2593240 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 646042 | |
| d | 589242 | |
| x | 360878 | |
| e | 301872 | |
| s | 295458 | |
| p | 279113 | |
| f | 119956 | 4.6% |
| c | 74 | < 0.1% |
| u | 57 | < 0.1% |
| . | 44 | < 0.1% |
| Other values (27) | 504 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2593240 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 646042 | |
| d | 589242 | |
| x | 360878 | |
| e | 301872 | |
| s | 295458 | |
| p | 279113 | |
| f | 119956 | 4.6% |
| c | 74 | < 0.1% |
| u | 57 | < 0.1% |
| . | 44 | < 0.1% |
| Other values (27) | 504 | < 0.1% |
gill-spacing
Categorical
IMBALANCE  MISSING 
| Distinct | 48 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1258435 |
| Missing (%) | 40.4% |
| Memory size | 155.8 MiB |
| c | |
|---|---|
| d | |
| f | 119380 |
| e | 24 |
| a | 17 |
| Other values (43) | 103 |
Length
| Max length | 11 |
|---|---|
| Median length | 1 |
| Mean length | 1.00005 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1858603 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 30 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | c |
|---|---|
| 2nd row | c |
| 3rd row | c |
| 4th row | c |
| 5th row | c |
Common Values
| Value | Count | Frequency (%) |
| c | 1331054 | |
| d | 407932 | 13.1% |
| f | 119380 | 3.8% |
| e | 24 | < 0.1% |
| a | 17 | < 0.1% |
| s | 16 | < 0.1% |
| b | 12 | < 0.1% |
| t | 8 | < 0.1% |
| x | 8 | < 0.1% |
| p | 7 | < 0.1% |
| Other values (38) | 52 | < 0.1% |
| (Missing) | 1258435 |
Length
| Value | Count | Frequency (%) |
| c | 1331054 | |
| d | 407932 | 21.9% |
| f | 119381 | 6.4% |
| e | 24 | < 0.1% |
| a | 17 | < 0.1% |
| s | 16 | < 0.1% |
| b | 12 | < 0.1% |
| t | 8 | < 0.1% |
| x | 8 | < 0.1% |
| p | 7 | < 0.1% |
| Other values (38) | 52 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 1331057 | |
| d | 407933 | 21.9% |
| f | 119382 | 6.4% |
| e | 26 | < 0.1% |
| . | 25 | < 0.1% |
| s | 20 | < 0.1% |
| a | 20 | < 0.1% |
| b | 12 | < 0.1% |
| 2 | 10 | < 0.1% |
| 3 | 10 | < 0.1% |
| Other values (24) | 108 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1858603 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| c | 1331057 | |
| d | 407933 | 21.9% |
| f | 119382 | 6.4% |
| e | 26 | < 0.1% |
| . | 25 | < 0.1% |
| s | 20 | < 0.1% |
| a | 20 | < 0.1% |
| b | 12 | < 0.1% |
| 2 | 10 | < 0.1% |
| 3 | 10 | < 0.1% |
| Other values (24) | 108 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1858603 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| c | 1331057 | |
| d | 407933 | 21.9% |
| f | 119382 | 6.4% |
| e | 26 | < 0.1% |
| . | 25 | < 0.1% |
| s | 20 | < 0.1% |
| a | 20 | < 0.1% |
| b | 12 | < 0.1% |
| 2 | 10 | < 0.1% |
| 3 | 10 | < 0.1% |
| Other values (24) | 108 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1858603 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| c | 1331057 | |
| d | 407933 | 21.9% |
| f | 119382 | 6.4% |
| e | 26 | < 0.1% |
| . | 25 | < 0.1% |
| s | 20 | < 0.1% |
| a | 20 | < 0.1% |
| b | 12 | < 0.1% |
| 2 | 10 | < 0.1% |
| 3 | 10 | < 0.1% |
| Other values (24) | 108 | < 0.1% |
gill-color
Text
| Distinct | 63 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 57 |
| Missing (%) | < 0.1% |
| Memory size | 148.6 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 1 |
| Mean length | 1.0001078 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3117224 |
|---|---|
| Distinct characters | 37 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 31 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | w |
|---|---|
| 2nd row | n |
| 3rd row | w |
| 4th row | g |
| 5th row | w |
| Value | Count | Frequency (%) |
| w | 931539 | |
| n | 543387 | |
| y | 469466 | |
| p | 343626 | 11.0% |
| g | 212164 | 6.8% |
| o | 157119 | 5.0% |
| k | 127970 | 4.1% |
| f | 119694 | 3.8% |
| r | 62799 | 2.0% |
| e | 56048 | 1.8% |
| Other values (51) | 93080 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| w | 931539 | |
| n | 543409 | |
| y | 469472 | |
| p | 343642 | 11.0% |
| g | 212176 | 6.8% |
| o | 157141 | 5.0% |
| k | 127970 | 4.1% |
| f | 119694 | 3.8% |
| r | 62819 | 2.0% |
| e | 56072 | 1.8% |
| Other values (27) | 93290 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3117224 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| w | 931539 | |
| n | 543409 | |
| y | 469472 | |
| p | 343642 | 11.0% |
| g | 212176 | 6.8% |
| o | 157141 | 5.0% |
| k | 127970 | 4.1% |
| f | 119694 | 3.8% |
| r | 62819 | 2.0% |
| e | 56072 | 1.8% |
| Other values (27) | 93290 | 3.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3117224 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| w | 931539 | |
| n | 543409 | |
| y | 469472 | |
| p | 343642 | 11.0% |
| g | 212176 | 6.8% |
| o | 157141 | 5.0% |
| k | 127970 | 4.1% |
| f | 119694 | 3.8% |
| r | 62819 | 2.0% |
| e | 56072 | 1.8% |
| Other values (27) | 93290 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3117224 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| w | 931539 | |
| n | 543409 | |
| y | 469472 | |
| p | 343642 | 11.0% |
| g | 212176 | 6.8% |
| o | 157141 | 5.0% |
| k | 127970 | 4.1% |
| f | 119694 | 3.8% |
| r | 62819 | 2.0% |
| e | 56072 | 1.8% |
| Other values (27) | 93290 | 3.0% |
stem-height
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 2749 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.3483333 |
| Minimum | 0 |
|---|---|
| Maximum | 88.72 |
| Zeros | 554 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 23.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3.16 |
| Q1 | 4.67 |
| median | 5.88 |
| Q3 | 7.41 |
| 95-th percentile | 11.2 |
| Maximum | 88.72 |
| Range | 88.72 |
| Interquartile range (IQR) | 2.74 |
Descriptive statistics
| Standard deviation | 2.6997548 |
|---|---|
| Coefficient of variation (CV) | 0.42526985 |
| Kurtosis | 7.7615498 |
| Mean | 6.3483333 |
| Median Absolute Deviation (MAD) | 1.33 |
| Skewness | 1.9266817 |
| Sum | 19787406 |
| Variance | 7.288676 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5.24 | 12332 | 0.4% |
| 5.92 | 11821 | 0.4% |
| 5.32 | 10988 | 0.4% |
| 5.35 | 10431 | 0.3% |
| 5.99 | 10402 | 0.3% |
| 6.03 | 10271 | 0.3% |
| 5.54 | 10265 | 0.3% |
| 5.77 | 10153 | 0.3% |
| 4.27 | 10080 | 0.3% |
| 5.65 | 9994 | 0.3% |
| Other values (2739) | 3010208 |
| Value | Count | Frequency (%) |
| 0 | 554 | |
| 0.74 | 1 | < 0.1% |
| 0.77 | 1 | < 0.1% |
| 0.91 | 1 | < 0.1% |
| 0.93 | 1 | < 0.1% |
| 0.97 | 2 | < 0.1% |
| 0.98 | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
| 1.01 | 1 | < 0.1% |
| 1.03 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 88.72 | 1 | |
| 57.22 | 1 | |
| 53.93 | 1 | |
| 53.87 | 1 | |
| 53.82 | 1 | |
| 53.03 | 1 | |
| 51.41 | 1 | |
| 50.78 | 1 | |
| 50.27 | 1 | |
| 49.37 | 1 |
stem-width
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 5836 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.153785 |
| Minimum | 0 |
|---|---|
| Maximum | 102.9 |
| Zeros | 497 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 23.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1.58 |
| Q1 | 4.97 |
| median | 9.65 |
| Q3 | 15.63 |
| 95-th percentile | 26.49 |
| Maximum | 102.9 |
| Range | 102.9 |
| Interquartile range (IQR) | 10.66 |
Descriptive statistics
| Standard deviation | 8.0954773 |
|---|---|
| Coefficient of variation (CV) | 0.72580538 |
| Kurtosis | 2.4489761 |
| Mean | 11.153785 |
| Median Absolute Deviation (MAD) | 5.24 |
| Skewness | 1.2354271 |
| Sum | 34765735 |
| Variance | 65.536753 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.41 | 7829 | 0.3% |
| 2.45 | 7353 | 0.2% |
| 2.49 | 7087 | 0.2% |
| 2.56 | 6824 | 0.2% |
| 2.47 | 6709 | 0.2% |
| 2.52 | 6660 | 0.2% |
| 2.51 | 6535 | 0.2% |
| 2.64 | 6467 | 0.2% |
| 2.6 | 6366 | 0.2% |
| 2.61 | 6117 | 0.2% |
| Other values (5826) | 3048998 |
| Value | Count | Frequency (%) |
| 0 | 497 | |
| 0.44 | 2 | < 0.1% |
| 0.48 | 2 | < 0.1% |
| 0.49 | 1 | < 0.1% |
| 0.5 | 3 | < 0.1% |
| 0.51 | 1 | < 0.1% |
| 0.52 | 21 | < 0.1% |
| 0.53 | 16 | < 0.1% |
| 0.54 | 11 | < 0.1% |
| 0.55 | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| 102.9 | 1 | < 0.1% |
| 102.48 | 6 | |
| 101.69 | 3 | |
| 98 | 2 | < 0.1% |
| 94.24 | 3 | |
| 94.05 | 1 | < 0.1% |
| 92.51 | 1 | < 0.1% |
| 91.91 | 1 | < 0.1% |
| 91 | 1 | < 0.1% |
| 89.45 | 2 | < 0.1% |
stem-root
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 38 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2757023 |
| Missing (%) | 88.5% |
| Memory size | 164.4 MiB |
| b | |
|---|---|
| s | |
| r | |
| c | |
| f | 597 |
| Other values (33) | 183 |
Length
| Max length | 17 |
|---|---|
| Median length | 1 |
| Mean length | 1.0001778 |
| Min length | 1 |
Characters and Unicode
| Total characters | 359986 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 15 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | b |
|---|---|
| 2nd row | b |
| 3rd row | c |
| 4th row | b |
| 5th row | r |
Common Values
| Value | Count | Frequency (%) |
| b | 165801 | 5.3% |
| s | 116946 | 3.8% |
| r | 47803 | 1.5% |
| c | 28592 | 0.9% |
| f | 597 | < 0.1% |
| d | 24 | < 0.1% |
| y | 14 | < 0.1% |
| g | 12 | < 0.1% |
| p | 12 | < 0.1% |
| w | 12 | < 0.1% |
| Other values (28) | 109 | < 0.1% |
| (Missing) | 2757023 |
Length
| Value | Count | Frequency (%) |
| b | 165801 | |
| s | 116946 | |
| r | 47803 | 13.3% |
| c | 28592 | 7.9% |
| f | 597 | 0.2% |
| d | 24 | < 0.1% |
| y | 14 | < 0.1% |
| p | 12 | < 0.1% |
| w | 12 | < 0.1% |
| g | 12 | < 0.1% |
| Other values (28) | 109 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| b | 165801 | |
| s | 116947 | |
| r | 47806 | 13.3% |
| c | 28593 | 7.9% |
| f | 597 | 0.2% |
| d | 24 | < 0.1% |
| p | 14 | < 0.1% |
| . | 14 | < 0.1% |
| y | 14 | < 0.1% |
| g | 12 | < 0.1% |
| Other values (25) | 164 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 359986 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| b | 165801 | |
| s | 116947 | |
| r | 47806 | 13.3% |
| c | 28593 | 7.9% |
| f | 597 | 0.2% |
| d | 24 | < 0.1% |
| p | 14 | < 0.1% |
| . | 14 | < 0.1% |
| y | 14 | < 0.1% |
| g | 12 | < 0.1% |
| Other values (25) | 164 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 359986 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| b | 165801 | |
| s | 116947 | |
| r | 47806 | 13.3% |
| c | 28593 | 7.9% |
| f | 597 | 0.2% |
| d | 24 | < 0.1% |
| p | 14 | < 0.1% |
| . | 14 | < 0.1% |
| y | 14 | < 0.1% |
| g | 12 | < 0.1% |
| Other values (25) | 164 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 359986 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| b | 165801 | |
| s | 116947 | |
| r | 47806 | 13.3% |
| c | 28593 | 7.9% |
| f | 597 | 0.2% |
| d | 24 | < 0.1% |
| p | 14 | < 0.1% |
| . | 14 | < 0.1% |
| y | 14 | < 0.1% |
| g | 12 | < 0.1% |
| Other values (25) | 164 | < 0.1% |
stem-surface
Text
MISSING 
| Distinct | 60 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1980861 |
| Missing (%) | 63.6% |
| Memory size | 114.6 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 1 |
| Mean length | 1.0001919 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1136302 |
|---|---|
| Distinct characters | 37 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 32 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | y |
|---|---|
| 2nd row | s |
| 3rd row | s |
| 4th row | t |
| 5th row | s |
| Value | Count | Frequency (%) |
| s | 327611 | |
| y | 255500 | |
| i | 224346 | |
| t | 147974 | |
| g | 78080 | 6.9% |
| k | 73383 | 6.5% |
| h | 28284 | 2.5% |
| f | 512 | < 0.1% |
| w | 49 | < 0.1% |
| d | 48 | < 0.1% |
| Other values (50) | 300 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 327634 | |
| y | 255500 | |
| i | 224350 | |
| t | 147975 | |
| g | 78081 | 6.9% |
| k | 73383 | 6.5% |
| h | 28286 | 2.5% |
| f | 512 | < 0.1% |
| d | 54 | < 0.1% |
| e | 54 | < 0.1% |
| Other values (27) | 473 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1136302 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| s | 327634 | |
| y | 255500 | |
| i | 224350 | |
| t | 147975 | |
| g | 78081 | 6.9% |
| k | 73383 | 6.5% |
| h | 28286 | 2.5% |
| f | 512 | < 0.1% |
| d | 54 | < 0.1% |
| e | 54 | < 0.1% |
| Other values (27) | 473 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1136302 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| s | 327634 | |
| y | 255500 | |
| i | 224350 | |
| t | 147975 | |
| g | 78081 | 6.9% |
| k | 73383 | 6.5% |
| h | 28286 | 2.5% |
| f | 512 | < 0.1% |
| d | 54 | < 0.1% |
| e | 54 | < 0.1% |
| Other values (27) | 473 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1136302 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| s | 327634 | |
| y | 255500 | |
| i | 224350 | |
| t | 147975 | |
| g | 78081 | 6.9% |
| k | 73383 | 6.5% |
| h | 28286 | 2.5% |
| f | 512 | < 0.1% |
| d | 54 | < 0.1% |
| e | 54 | < 0.1% |
| Other values (27) | 473 | < 0.1% |
stem-color
Text
| Distinct | 59 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 38 |
| Missing (%) | < 0.1% |
| Memory size | 148.6 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 1 |
| Mean length | 1.0000539 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3117075 |
|---|---|
| Distinct characters | 36 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 33 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | w |
|---|---|
| 2nd row | o |
| 3rd row | n |
| 4th row | w |
| 5th row | w |
| Value | Count | Frequency (%) |
| w | 1196638 | |
| n | 1003466 | |
| y | 373971 | 12.0% |
| g | 132019 | 4.2% |
| o | 111541 | 3.6% |
| e | 103374 | 3.3% |
| u | 67017 | 2.2% |
| p | 54690 | 1.8% |
| k | 33676 | 1.1% |
| r | 22329 | 0.7% |
| Other values (47) | 18189 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| w | 1196638 | |
| n | 1003471 | |
| y | 373974 | 12.0% |
| g | 132022 | 4.2% |
| o | 111547 | 3.6% |
| e | 103379 | 3.3% |
| u | 67017 | 2.1% |
| p | 54697 | 1.8% |
| k | 33676 | 1.1% |
| r | 22338 | 0.7% |
| Other values (26) | 18316 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3117075 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| w | 1196638 | |
| n | 1003471 | |
| y | 373974 | 12.0% |
| g | 132022 | 4.2% |
| o | 111547 | 3.6% |
| e | 103379 | 3.3% |
| u | 67017 | 2.1% |
| p | 54697 | 1.8% |
| k | 33676 | 1.1% |
| r | 22338 | 0.7% |
| Other values (26) | 18316 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3117075 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| w | 1196638 | |
| n | 1003471 | |
| y | 373974 | 12.0% |
| g | 132022 | 4.2% |
| o | 111547 | 3.6% |
| e | 103379 | 3.3% |
| u | 67017 | 2.1% |
| p | 54697 | 1.8% |
| k | 33676 | 1.1% |
| r | 22338 | 0.7% |
| Other values (26) | 18316 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3117075 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| w | 1196638 | |
| n | 1003471 | |
| y | 373974 | 12.0% |
| g | 132022 | 4.2% |
| o | 111547 | 3.6% |
| e | 103379 | 3.3% |
| u | 67017 | 2.1% |
| p | 54697 | 1.8% |
| k | 33676 | 1.1% |
| r | 22338 | 0.7% |
| Other values (26) | 18316 | 0.6% |
veil-type
Categorical
IMBALANCE  MISSING 
| Distinct | 22 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2957493 |
| Missing (%) | 94.9% |
| Memory size | 165.6 MiB |
| u | |
|---|---|
| w | 11 |
| a | 9 |
| e | 8 |
| f | 8 |
| Other values (17) | 43 |
Length
| Max length | 7 |
|---|---|
| Median length | 1 |
| Mean length | 1.0000815 |
| Min length | 1 |
Characters and Unicode
| Total characters | 159465 |
|---|---|
| Distinct characters | 28 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | u |
|---|---|
| 2nd row | u |
| 3rd row | u |
| 4th row | u |
| 5th row | u |
Common Values
| Value | Count | Frequency (%) |
| u | 159373 | 5.1% |
| w | 11 | < 0.1% |
| a | 9 | < 0.1% |
| e | 8 | < 0.1% |
| f | 8 | < 0.1% |
| b | 5 | < 0.1% |
| c | 5 | < 0.1% |
| g | 4 | < 0.1% |
| y | 4 | < 0.1% |
| k | 4 | < 0.1% |
| Other values (12) | 21 | < 0.1% |
| (Missing) | 2957493 |
Length
| Value | Count | Frequency (%) |
| u | 159373 | |
| w | 11 | < 0.1% |
| a | 9 | < 0.1% |
| e | 8 | < 0.1% |
| f | 8 | < 0.1% |
| b | 5 | < 0.1% |
| c | 5 | < 0.1% |
| g | 4 | < 0.1% |
| y | 4 | < 0.1% |
| k | 4 | < 0.1% |
| Other values (13) | 22 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| u | 159373 | |
| w | 11 | < 0.1% |
| a | 9 | < 0.1% |
| e | 9 | < 0.1% |
| f | 8 | < 0.1% |
| b | 5 | < 0.1% |
| c | 5 | < 0.1% |
| g | 4 | < 0.1% |
| y | 4 | < 0.1% |
| k | 4 | < 0.1% |
| Other values (18) | 33 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 159465 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| u | 159373 | |
| w | 11 | < 0.1% |
| a | 9 | < 0.1% |
| e | 9 | < 0.1% |
| f | 8 | < 0.1% |
| b | 5 | < 0.1% |
| c | 5 | < 0.1% |
| g | 4 | < 0.1% |
| y | 4 | < 0.1% |
| k | 4 | < 0.1% |
| Other values (18) | 33 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 159465 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| u | 159373 | |
| w | 11 | < 0.1% |
| a | 9 | < 0.1% |
| e | 9 | < 0.1% |
| f | 8 | < 0.1% |
| b | 5 | < 0.1% |
| c | 5 | < 0.1% |
| g | 4 | < 0.1% |
| y | 4 | < 0.1% |
| k | 4 | < 0.1% |
| Other values (18) | 33 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 159465 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| u | 159373 | |
| w | 11 | < 0.1% |
| a | 9 | < 0.1% |
| e | 9 | < 0.1% |
| f | 8 | < 0.1% |
| b | 5 | < 0.1% |
| c | 5 | < 0.1% |
| g | 4 | < 0.1% |
| y | 4 | < 0.1% |
| k | 4 | < 0.1% |
| Other values (18) | 33 | < 0.1% |
veil-color
Categorical
IMBALANCE  MISSING 
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2740947 |
| Missing (%) | 87.9% |
| Memory size | 164.3 MiB |
| w | |
|---|---|
| y | |
| n | |
| u | 14026 |
| k | 13080 |
| Other values (19) | 9310 |
Length
| Max length | 4 |
|---|---|
| Median length | 1 |
| Mean length | 1.0000239 |
| Min length | 1 |
Characters and Unicode
| Total characters | 376007 |
|---|---|
| Distinct characters | 28 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | n |
|---|---|
| 2nd row | w |
| 3rd row | w |
| 4th row | w |
| 5th row | n |
Common Values
| Value | Count | Frequency (%) |
| w | 279070 | 9.0% |
| y | 30473 | 1.0% |
| n | 30039 | 1.0% |
| u | 14026 | 0.4% |
| k | 13080 | 0.4% |
| e | 9169 | 0.3% |
| g | 30 | < 0.1% |
| p | 23 | < 0.1% |
| r | 14 | < 0.1% |
| o | 13 | < 0.1% |
| Other values (14) | 61 | < 0.1% |
| (Missing) | 2740947 |
Length
| Value | Count | Frequency (%) |
| w | 279070 | |
| y | 30473 | 8.1% |
| n | 30039 | 8.0% |
| u | 14026 | 3.7% |
| k | 13080 | 3.5% |
| e | 9169 | 2.4% |
| g | 30 | < 0.1% |
| p | 23 | < 0.1% |
| r | 14 | < 0.1% |
| o | 13 | < 0.1% |
| Other values (14) | 61 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| w | 279070 | |
| y | 30473 | 8.1% |
| n | 30039 | 8.0% |
| u | 14026 | 3.7% |
| k | 13080 | 3.5% |
| e | 9169 | 2.4% |
| g | 30 | < 0.1% |
| p | 23 | < 0.1% |
| r | 14 | < 0.1% |
| o | 13 | < 0.1% |
| Other values (18) | 70 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 376007 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| w | 279070 | |
| y | 30473 | 8.1% |
| n | 30039 | 8.0% |
| u | 14026 | 3.7% |
| k | 13080 | 3.5% |
| e | 9169 | 2.4% |
| g | 30 | < 0.1% |
| p | 23 | < 0.1% |
| r | 14 | < 0.1% |
| o | 13 | < 0.1% |
| Other values (18) | 70 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 376007 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| w | 279070 | |
| y | 30473 | 8.1% |
| n | 30039 | 8.0% |
| u | 14026 | 3.7% |
| k | 13080 | 3.5% |
| e | 9169 | 2.4% |
| g | 30 | < 0.1% |
| p | 23 | < 0.1% |
| r | 14 | < 0.1% |
| o | 13 | < 0.1% |
| Other values (18) | 70 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 376007 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| w | 279070 | |
| y | 30473 | 8.1% |
| n | 30039 | 8.0% |
| u | 14026 | 3.7% |
| k | 13080 | 3.5% |
| e | 9169 | 2.4% |
| g | 30 | < 0.1% |
| p | 23 | < 0.1% |
| r | 14 | < 0.1% |
| o | 13 | < 0.1% |
| Other values (18) | 70 | < 0.1% |
has-ring
Categorical
IMBALANCE 
| Distinct | 23 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 24 |
| Missing (%) | < 0.1% |
| Memory size | 148.6 MiB |
| f | |
|---|---|
| t | |
| r | 16 |
| h | 13 |
| c | 11 |
| Other values (18) | 79 |
Length
| Max length | 10 |
|---|---|
| Median length | 1 |
| Mean length | 1.0000038 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3116933 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | f |
|---|---|
| 2nd row | t |
| 3rd row | f |
| 4th row | f |
| 5th row | f |
Common Values
| Value | Count | Frequency (%) |
| f | 2368820 | |
| t | 747982 | 24.0% |
| r | 16 | < 0.1% |
| h | 13 | < 0.1% |
| c | 11 | < 0.1% |
| l | 11 | < 0.1% |
| s | 11 | < 0.1% |
| p | 11 | < 0.1% |
| g | 8 | < 0.1% |
| z | 6 | < 0.1% |
| Other values (13) | 32 | < 0.1% |
| (Missing) | 24 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| f | 2368821 | |
| t | 747982 | 24.0% |
| r | 16 | < 0.1% |
| h | 13 | < 0.1% |
| c | 11 | < 0.1% |
| l | 11 | < 0.1% |
| s | 11 | < 0.1% |
| p | 11 | < 0.1% |
| g | 8 | < 0.1% |
| z | 6 | < 0.1% |
| Other values (13) | 32 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 2368821 | |
| t | 747982 | 24.0% |
| r | 17 | < 0.1% |
| h | 14 | < 0.1% |
| s | 12 | < 0.1% |
| c | 11 | < 0.1% |
| l | 11 | < 0.1% |
| p | 11 | < 0.1% |
| g | 9 | < 0.1% |
| z | 6 | < 0.1% |
| Other values (17) | 39 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3116933 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| f | 2368821 | |
| t | 747982 | 24.0% |
| r | 17 | < 0.1% |
| h | 14 | < 0.1% |
| s | 12 | < 0.1% |
| c | 11 | < 0.1% |
| l | 11 | < 0.1% |
| p | 11 | < 0.1% |
| g | 9 | < 0.1% |
| z | 6 | < 0.1% |
| Other values (17) | 39 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3116933 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| f | 2368821 | |
| t | 747982 | 24.0% |
| r | 17 | < 0.1% |
| h | 14 | < 0.1% |
| s | 12 | < 0.1% |
| c | 11 | < 0.1% |
| l | 11 | < 0.1% |
| p | 11 | < 0.1% |
| g | 9 | < 0.1% |
| z | 6 | < 0.1% |
| Other values (17) | 39 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3116933 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| f | 2368821 | |
| t | 747982 | 24.0% |
| r | 17 | < 0.1% |
| h | 14 | < 0.1% |
| s | 12 | < 0.1% |
| c | 11 | < 0.1% |
| l | 11 | < 0.1% |
| p | 11 | < 0.1% |
| g | 9 | < 0.1% |
| z | 6 | < 0.1% |
| Other values (17) | 39 | < 0.1% |
ring-type
Categorical
IMBALANCE  MISSING 
| Distinct | 40 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 128880 |
| Missing (%) | 4.1% |
| Memory size | 149.4 MiB |
| f | |
|---|---|
| e | 120006 |
| z | 113780 |
| l | 73443 |
| r | 67909 |
| Other values (35) | 135757 |
Length
| Max length | 20 |
|---|---|
| Median length | 1 |
| Mean length | 1.0000472 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2988206 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 14 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | f |
|---|---|
| 2nd row | z |
| 3rd row | f |
| 4th row | f |
| 5th row | f |
Common Values
| Value | Count | Frequency (%) |
| f | 2477170 | |
| e | 120006 | 3.9% |
| z | 113780 | 3.7% |
| l | 73443 | 2.4% |
| r | 67909 | 2.2% |
| p | 67678 | 2.2% |
| g | 63687 | 2.0% |
| m | 3992 | 0.1% |
| t | 98 | < 0.1% |
| d | 37 | < 0.1% |
| Other values (30) | 265 | < 0.1% |
| (Missing) | 128880 | 4.1% |
Length
| Value | Count | Frequency (%) |
| f | 2477173 | |
| e | 120006 | 4.0% |
| z | 113780 | 3.8% |
| l | 73443 | 2.5% |
| r | 67909 | 2.3% |
| p | 67678 | 2.3% |
| g | 63687 | 2.1% |
| m | 3992 | 0.1% |
| t | 98 | < 0.1% |
| d | 37 | < 0.1% |
| Other values (30) | 265 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 2477173 | |
| e | 120024 | 4.0% |
| z | 113780 | 3.8% |
| l | 73446 | 2.5% |
| r | 67921 | 2.3% |
| p | 67688 | 2.3% |
| g | 63694 | 2.1% |
| m | 3992 | 0.1% |
| t | 106 | < 0.1% |
| n | 45 | < 0.1% |
| Other values (24) | 337 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2988206 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| f | 2477173 | |
| e | 120024 | 4.0% |
| z | 113780 | 3.8% |
| l | 73446 | 2.5% |
| r | 67921 | 2.3% |
| p | 67688 | 2.3% |
| g | 63694 | 2.1% |
| m | 3992 | 0.1% |
| t | 106 | < 0.1% |
| n | 45 | < 0.1% |
| Other values (24) | 337 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2988206 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| f | 2477173 | |
| e | 120024 | 4.0% |
| z | 113780 | 3.8% |
| l | 73446 | 2.5% |
| r | 67921 | 2.3% |
| p | 67688 | 2.3% |
| g | 63694 | 2.1% |
| m | 3992 | 0.1% |
| t | 106 | < 0.1% |
| n | 45 | < 0.1% |
| Other values (24) | 337 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2988206 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| f | 2477173 | |
| e | 120024 | 4.0% |
| z | 113780 | 3.8% |
| l | 73446 | 2.5% |
| r | 67921 | 2.3% |
| p | 67688 | 2.3% |
| g | 63694 | 2.1% |
| m | 3992 | 0.1% |
| t | 106 | < 0.1% |
| n | 45 | < 0.1% |
| Other values (24) | 337 | < 0.1% |
spore-print-color
Categorical
IMBALANCE  MISSING 
| Distinct | 32 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2849682 |
| Missing (%) | 91.4% |
| Memory size | 164.9 MiB |
| k | |
|---|---|
| p | |
| w | |
| n | |
| r | 7975 |
| Other values (27) |
Length
| Max length | 10 |
|---|---|
| Median length | 1 |
| Mean length | 1.0001983 |
| Min length | 1 |
Characters and Unicode
| Total characters | 267316 |
|---|---|
| Distinct characters | 36 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | k |
|---|---|
| 2nd row | w |
| 3rd row | k |
| 4th row | k |
| 5th row | p |
Common Values
| Value | Count | Frequency (%) |
| k | 107310 | 3.4% |
| p | 68237 | 2.2% |
| w | 50173 | 1.6% |
| n | 22646 | 0.7% |
| r | 7975 | 0.3% |
| u | 7256 | 0.2% |
| g | 3492 | 0.1% |
| y | 36 | < 0.1% |
| s | 21 | < 0.1% |
| c | 16 | < 0.1% |
| Other values (22) | 101 | < 0.1% |
| (Missing) | 2849682 |
Length
| Value | Count | Frequency (%) |
| k | 107310 | |
| p | 68237 | |
| w | 50173 | |
| n | 22646 | 8.5% |
| r | 7975 | 3.0% |
| u | 7256 | 2.7% |
| g | 3492 | 1.3% |
| y | 36 | < 0.1% |
| s | 21 | < 0.1% |
| c | 16 | < 0.1% |
| Other values (23) | 103 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| k | 107310 | |
| p | 68237 | |
| w | 50173 | |
| n | 22649 | 8.5% |
| r | 7977 | 3.0% |
| u | 7256 | 2.7% |
| g | 3492 | 1.3% |
| y | 36 | < 0.1% |
| s | 25 | < 0.1% |
| c | 19 | < 0.1% |
| Other values (26) | 142 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 267316 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| k | 107310 | |
| p | 68237 | |
| w | 50173 | |
| n | 22649 | 8.5% |
| r | 7977 | 3.0% |
| u | 7256 | 2.7% |
| g | 3492 | 1.3% |
| y | 36 | < 0.1% |
| s | 25 | < 0.1% |
| c | 19 | < 0.1% |
| Other values (26) | 142 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 267316 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| k | 107310 | |
| p | 68237 | |
| w | 50173 | |
| n | 22649 | 8.5% |
| r | 7977 | 3.0% |
| u | 7256 | 2.7% |
| g | 3492 | 1.3% |
| y | 36 | < 0.1% |
| s | 25 | < 0.1% |
| c | 19 | < 0.1% |
| Other values (26) | 142 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 267316 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| k | 107310 | |
| p | 68237 | |
| w | 50173 | |
| n | 22649 | 8.5% |
| r | 7977 | 3.0% |
| u | 7256 | 2.7% |
| g | 3492 | 1.3% |
| y | 36 | < 0.1% |
| s | 25 | < 0.1% |
| c | 19 | < 0.1% |
| Other values (26) | 142 | 0.1% |
habitat
Text
| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 45 |
| Missing (%) | < 0.1% |
| Memory size | 148.6 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 1 |
| Mean length | 1.0000677 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3117111 |
|---|---|
| Distinct characters | 37 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 25 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | d |
|---|---|
| 2nd row | d |
| 3rd row | l |
| 4th row | d |
| 5th row | g |
| Value | Count | Frequency (%) |
| d | 2177573 | |
| g | 454908 | 14.6% |
| l | 171892 | 5.5% |
| m | 150969 | 4.8% |
| h | 120138 | 3.9% |
| w | 18531 | 0.6% |
| p | 17180 | 0.6% |
| u | 5264 | 0.2% |
| e | 55 | < 0.1% |
| s | 52 | < 0.1% |
| Other values (41) | 340 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 2177576 | |
| g | 454910 | 14.6% |
| l | 171900 | 5.5% |
| m | 150970 | 4.8% |
| h | 120143 | 3.9% |
| w | 18531 | 0.6% |
| p | 17190 | 0.6% |
| u | 5265 | 0.2% |
| e | 68 | < 0.1% |
| s | 65 | < 0.1% |
| Other values (27) | 493 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3117111 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| d | 2177576 | |
| g | 454910 | 14.6% |
| l | 171900 | 5.5% |
| m | 150970 | 4.8% |
| h | 120143 | 3.9% |
| w | 18531 | 0.6% |
| p | 17190 | 0.6% |
| u | 5265 | 0.2% |
| e | 68 | < 0.1% |
| s | 65 | < 0.1% |
| Other values (27) | 493 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3117111 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| d | 2177576 | |
| g | 454910 | 14.6% |
| l | 171900 | 5.5% |
| m | 150970 | 4.8% |
| h | 120143 | 3.9% |
| w | 18531 | 0.6% |
| p | 17190 | 0.6% |
| u | 5265 | 0.2% |
| e | 68 | < 0.1% |
| s | 65 | < 0.1% |
| Other values (27) | 493 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3117111 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| d | 2177576 | |
| g | 454910 | 14.6% |
| l | 171900 | 5.5% |
| m | 150970 | 4.8% |
| h | 120143 | 3.9% |
| w | 18531 | 0.6% |
| p | 17190 | 0.6% |
| u | 5265 | 0.2% |
| e | 68 | < 0.1% |
| s | 65 | < 0.1% |
| Other values (27) | 493 | < 0.1% |
season
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 148.6 MiB |
| a | |
|---|---|
| u | |
| w | |
| s | 141847 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3116945 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | a |
|---|---|
| 2nd row | w |
| 3rd row | w |
| 4th row | u |
| 5th row | a |
Common Values
| Value | Count | Frequency (%) |
| a | 1543321 | |
| u | 1153588 | |
| w | 278189 | 8.9% |
| s | 141847 | 4.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| a | 1543321 | |
| u | 1153588 | |
| w | 278189 | 8.9% |
| s | 141847 | 4.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1543321 | |
| u | 1153588 | |
| w | 278189 | 8.9% |
| s | 141847 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3116945 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1543321 | |
| u | 1153588 | |
| w | 278189 | 8.9% |
| s | 141847 | 4.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3116945 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1543321 | |
| u | 1153588 | |
| w | 278189 | 8.9% |
| s | 141847 | 4.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3116945 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1543321 | |
| u | 1153588 | |
| w | 278189 | 8.9% |
| s | 141847 | 4.6% |
| cap-diameter | class | does-bruise-or-bleed | gill-spacing | has-ring | id | ring-type | season | spore-print-color | stem-height | stem-root | stem-width | veil-color | veil-type | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| cap-diameter | 1.000 | 0.158 | 0.110 | 0.064 | 0.050 | 0.000 | 0.094 | 0.091 | 0.275 | 0.512 | 0.337 | 0.883 | 0.113 | 0.000 |
| class | 0.158 | 1.000 | 0.038 | 0.140 | 0.050 | 0.000 | 0.197 | 0.149 | 0.426 | 0.073 | 0.521 | 0.218 | 0.496 | 0.003 |
| does-bruise-or-bleed | 0.110 | 0.038 | 1.000 | 0.035 | 0.009 | 0.000 | 0.043 | 0.092 | 0.085 | 0.044 | 0.109 | 0.106 | 0.178 | 0.000 |
| gill-spacing | 0.064 | 0.140 | 0.035 | 1.000 | 0.048 | 0.001 | 0.044 | 0.155 | 0.405 | 0.030 | 0.197 | 0.088 | 0.115 | 0.167 |
| has-ring | 0.050 | 0.050 | 0.009 | 0.048 | 1.000 | 0.000 | 0.194 | 0.023 | 0.177 | 0.105 | 0.085 | 0.079 | 0.142 | 0.000 |
| id | 0.000 | 0.000 | 0.000 | 0.001 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.002 |
| ring-type | 0.094 | 0.197 | 0.043 | 0.044 | 0.194 | 0.000 | 1.000 | 0.070 | 0.261 | 0.227 | 0.142 | 0.121 | 0.180 | 0.072 |
| season | 0.091 | 0.149 | 0.092 | 0.155 | 0.023 | 0.000 | 0.070 | 1.000 | 0.213 | 0.051 | 0.147 | 0.073 | 0.147 | 0.000 |
| spore-print-color | 0.275 | 0.426 | 0.085 | 0.405 | 0.177 | 0.000 | 0.261 | 0.213 | 1.000 | 0.095 | 0.344 | 0.360 | 0.271 | 0.152 |
| stem-height | 0.512 | 0.073 | 0.044 | 0.030 | 0.105 | 0.000 | 0.227 | 0.051 | 0.095 | 1.000 | 0.246 | 0.449 | 0.195 | 0.006 |
| stem-root | 0.337 | 0.521 | 0.109 | 0.197 | 0.085 | 0.000 | 0.142 | 0.147 | 0.344 | 0.246 | 1.000 | 0.237 | 0.327 | 0.080 |
| stem-width | 0.883 | 0.218 | 0.106 | 0.088 | 0.079 | 0.000 | 0.121 | 0.073 | 0.360 | 0.449 | 0.237 | 1.000 | 0.211 | 0.010 |
| veil-color | 0.113 | 0.496 | 0.178 | 0.115 | 0.142 | 0.000 | 0.180 | 0.147 | 0.271 | 0.195 | 0.327 | 0.211 | 1.000 | 0.390 |
| veil-type | 0.000 | 0.003 | 0.000 | 0.167 | 0.000 | 0.002 | 0.072 | 0.000 | 0.152 | 0.006 | 0.080 | 0.010 | 0.390 | 1.000 |